Dataset statistics
| Number of variables | 17 |
|---|---|
| Number of observations | 14259 |
| Missing cells | 3707 |
| Missing cells (%) | 1.5% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 2.0 MiB |
| Average record size in memory | 144.0 B |
Variable types
| Categorical | 6 |
|---|---|
| Numeric | 6 |
| Text | 4 |
| DateTime | 1 |
parent_region has constant value "" | Constant |
light is highly imbalanced (85.5%) | Imbalance |
weather_conditions has 3027 (21.2%) missing values | Missing |
address has 660 (4.6%) missing values | Missing |
temperature has 277 (1.9%) zeros | Zeros |
wind_speed has 2925 (20.5%) zeros | Zeros |
cloudiness has 2918 (20.5%) zeros | Zeros |
Reproduction
| Analysis started | 2024-01-10 15:17:16.644133 |
|---|---|
| Analysis finished | 2024-01-10 15:17:23.866074 |
| Duration | 7.22 seconds |
| Software version | ydata-profiling vv4.6.2 |
| Download configuration | config.json |
year
Categorical
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 1 |
| Missing (%) | < 0.1% |
| Memory size | 222.8 KiB |
| 2019.0 | |
|---|---|
| 2020.0 | |
| 2018.0 | |
| 2021.0 | |
| 2017.0 |
Length
| Max length | 6 |
|---|---|
| Median length | 6 |
| Mean length | 6 |
| Min length | 6 |
Characters and Unicode
| Total characters | 85548 |
|---|---|
| Distinct characters | 7 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 2021.0 |
|---|---|
| 2nd row | 2021.0 |
| 3rd row | 2021.0 |
| 4th row | 2021.0 |
| 5th row | 2020.0 |
Common Values
| Value | Count | Frequency (%) |
| 2019.0 | 3529 | |
| 2020.0 | 3055 | |
| 2018.0 | 2684 | |
| 2021.0 | 2648 | |
| 2017.0 | 2342 | |
| (Missing) | 1 | < 0.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 2019.0 | 3529 | |
| 2020.0 | 3055 | |
| 2018.0 | 2684 | |
| 2021.0 | 2648 | |
| 2017.0 | 2342 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 31571 | |
| 2 | 19961 | |
| . | 14258 | |
| 1 | 11203 | 13.1% |
| 9 | 3529 | 4.1% |
| 8 | 2684 | 3.1% |
| 7 | 2342 | 2.7% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 71290 | |
| Other Punctuation | 14258 | 16.7% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 31571 | |
| 2 | 19961 | |
| 1 | 11203 | 15.7% |
| 9 | 3529 | 5.0% |
| 8 | 2684 | 3.8% |
| 7 | 2342 | 3.3% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 14258 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 85548 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 31571 | |
| 2 | 19961 | |
| . | 14258 | |
| 1 | 11203 | 13.1% |
| 9 | 3529 | 4.1% |
| 8 | 2684 | 3.1% |
| 7 | 2342 | 2.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 85548 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 31571 | |
| 2 | 19961 | |
| . | 14258 | |
| 1 | 11203 | 13.1% |
| 9 | 3529 | 4.1% |
| 8 | 2684 | 3.1% |
| 7 | 2342 | 2.7% |
month
Real number (ℝ)
| Distinct | 12 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 1 |
| Missing (%) | < 0.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 7.1795483 |
| Minimum | 1 |
|---|---|
| Maximum | 12 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 222.8 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 3 |
| median | 8 |
| Q3 | 11 |
| 95-th percentile | 12 |
| Maximum | 12 |
| Range | 11 |
| Interquartile range (IQR) | 8 |
Descriptive statistics
| Standard deviation | 3.8530957 |
|---|---|
| Coefficient of variation (CV) | 0.53667661 |
| Kurtosis | -1.3970412 |
| Mean | 7.1795483 |
| Median Absolute Deviation (MAD) | 3 |
| Skewness | -0.32884043 |
| Sum | 102366 |
| Variance | 14.846346 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 11 | 2073 | |
| 12 | 1817 | |
| 10 | 1736 | |
| 1 | 1506 | |
| 9 | 1305 | |
| 2 | 1200 | |
| 3 | 1105 | |
| 8 | 904 | |
| 4 | 760 | 5.3% |
| 7 | 671 | 4.7% |
| Other values (2) | 1181 |
| Value | Count | Frequency (%) |
| 1 | 1506 | |
| 2 | 1200 | |
| 3 | 1105 | |
| 4 | 760 | |
| 5 | 622 | 4.4% |
| 6 | 559 | 3.9% |
| 7 | 671 | 4.7% |
| 8 | 904 | |
| 9 | 1305 | |
| 10 | 1736 |
| Value | Count | Frequency (%) |
| 12 | 1817 | |
| 11 | 2073 | |
| 10 | 1736 | |
| 9 | 1305 | |
| 8 | 904 | |
| 7 | 671 | 4.7% |
| 6 | 559 | 3.9% |
| 5 | 622 | 4.4% |
| 4 | 760 | 5.3% |
| 3 | 1105 |
temperature
Real number (ℝ)
ZEROS 
| Distinct | 500 |
|---|---|
| Distinct (%) | 3.5% |
| Missing | 5 |
| Missing (%) | < 0.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 4.7320191 |
| Minimum | -27.7 |
|---|---|
| Maximum | 32 |
| Zeros | 277 |
| Zeros (%) | 1.9% |
| Negative | 4071 |
| Negative (%) | 28.6% |
| Memory size | 222.8 KiB |
Quantile statistics
| Minimum | -27.7 |
|---|---|
| 5-th percentile | -9 |
| Q1 | -1 |
| median | 3.1 |
| Q3 | 11.375 |
| 95-th percentile | 19.9 |
| Maximum | 32 |
| Range | 59.7 |
| Interquartile range (IQR) | 12.375 |
Descriptive statistics
| Standard deviation | 8.911348 |
|---|---|
| Coefficient of variation (CV) | 1.883202 |
| Kurtosis | -0.23500387 |
| Mean | 4.7320191 |
| Median Absolute Deviation (MAD) | 5.7 |
| Skewness | 0.12261362 |
| Sum | 67450.2 |
| Variance | 79.412123 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 2 | 403 | 2.8% |
| 1 | 334 | 2.3% |
| 0 | 277 | 1.9% |
| 3 | 256 | 1.8% |
| -2 | 233 | 1.6% |
| -1 | 230 | 1.6% |
| 5 | 208 | 1.5% |
| 9 | 179 | 1.3% |
| 4 | 160 | 1.1% |
| 7 | 153 | 1.1% |
| Other values (490) | 11821 |
| Value | Count | Frequency (%) |
| -27.7 | 1 | |
| -27.4 | 1 | |
| -27.2 | 1 | |
| -26.8 | 1 | |
| -26.7 | 1 | |
| -26.4 | 1 | |
| -26.3 | 1 | |
| -26.2 | 1 | |
| -25.3 | 1 | |
| -25 | 2 |
| Value | Count | Frequency (%) |
| 32 | 1 | < 0.1% |
| 31.2 | 1 | < 0.1% |
| 31 | 1 | < 0.1% |
| 30.4 | 2 | < 0.1% |
| 30 | 1 | < 0.1% |
| 29.9 | 1 | < 0.1% |
| 29.7 | 2 | < 0.1% |
| 29 | 5 | |
| 28.8 | 1 | < 0.1% |
| 28.5 | 2 | < 0.1% |
atmospheric_pressure
Real number (ℝ)
| Distinct | 486 |
|---|---|
| Distinct (%) | 3.4% |
| Missing | 1 |
| Missing (%) | < 0.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 761.73237 |
| Minimum | 724.7 |
|---|---|
| Maximum | 787.4 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 222.8 KiB |
Quantile statistics
| Minimum | 724.7 |
|---|---|
| 5-th percentile | 748.5 |
| Q1 | 756.7 |
| median | 761.5 |
| Q3 | 766.8 |
| 95-th percentile | 775.5 |
| Maximum | 787.4 |
| Range | 62.7 |
| Interquartile range (IQR) | 10.1 |
Descriptive statistics
| Standard deviation | 8.1638682 |
|---|---|
| Coefficient of variation (CV) | 0.010717502 |
| Kurtosis | 0.47223358 |
| Mean | 761.73237 |
| Median Absolute Deviation (MAD) | 5.1 |
| Skewness | 0.0041896556 |
| Sum | 10860780 |
| Variance | 66.648743 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 758.2 | 255 | 1.8% |
| 759.7 | 237 | 1.7% |
| 762.8 | 233 | 1.6% |
| 761.2 | 225 | 1.6% |
| 759 | 222 | 1.6% |
| 763.5 | 215 | 1.5% |
| 765 | 202 | 1.4% |
| 757.7 | 196 | 1.4% |
| 766.6 | 191 | 1.3% |
| 764.3 | 187 | 1.3% |
| Other values (476) | 12095 |
| Value | Count | Frequency (%) |
| 724.7 | 1 | < 0.1% |
| 725.4 | 1 | < 0.1% |
| 725.9 | 2 | |
| 729 | 1 | < 0.1% |
| 729.7 | 2 | |
| 730 | 2 | |
| 730.4 | 1 | < 0.1% |
| 730.5 | 3 | |
| 733.8 | 1 | < 0.1% |
| 735.1 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 787.4 | 1 | < 0.1% |
| 786.9 | 2 | |
| 786.8 | 1 | < 0.1% |
| 786.7 | 3 | |
| 786.6 | 1 | < 0.1% |
| 786.1 | 3 | |
| 785.8 | 1 | < 0.1% |
| 785.6 | 1 | < 0.1% |
| 785.5 | 1 | < 0.1% |
| 785.4 | 2 |
humidity
Real number (ℝ)
| Distinct | 81 |
|---|---|
| Distinct (%) | 0.6% |
| Missing | 6 |
| Missing (%) | < 0.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 79.034659 |
| Minimum | 19 |
|---|---|
| Maximum | 100 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 222.8 KiB |
Quantile statistics
| Minimum | 19 |
|---|---|
| 5-th percentile | 48 |
| Q1 | 71 |
| median | 83 |
| Q3 | 91 |
| 95-th percentile | 97 |
| Maximum | 100 |
| Range | 81 |
| Interquartile range (IQR) | 20 |
Descriptive statistics
| Standard deviation | 15.288426 |
|---|---|
| Coefficient of variation (CV) | 0.19343951 |
| Kurtosis | 0.6910522 |
| Mean | 79.034659 |
| Median Absolute Deviation (MAD) | 9 |
| Skewness | -1.0608264 |
| Sum | 1126481 |
| Variance | 233.73596 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 93 | 1044 | 7.3% |
| 87 | 722 | 5.1% |
| 86 | 604 | 4.2% |
| 94 | 529 | 3.7% |
| 88 | 497 | 3.5% |
| 100 | 468 | 3.3% |
| 92 | 427 | 3.0% |
| 81 | 412 | 2.9% |
| 90 | 398 | 2.8% |
| 80 | 396 | 2.8% |
| Other values (71) | 8756 |
| Value | Count | Frequency (%) |
| 19 | 2 | < 0.1% |
| 20 | 1 | < 0.1% |
| 22 | 3 | < 0.1% |
| 23 | 4 | < 0.1% |
| 24 | 2 | < 0.1% |
| 25 | 8 | |
| 26 | 5 | < 0.1% |
| 27 | 8 | |
| 28 | 16 | |
| 29 | 9 |
| Value | Count | Frequency (%) |
| 100 | 468 | |
| 99 | 20 | 0.1% |
| 98 | 54 | 0.4% |
| 97 | 201 | 1.4% |
| 96 | 263 | 1.8% |
| 95 | 317 | 2.2% |
| 94 | 529 | |
| 93 | 1044 | |
| 92 | 427 | |
| 91 | 363 | 2.5% |
direction_of_the_wind
Categorical
| Distinct | 18 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 1 |
| Missing (%) | < 0.1% |
| Memory size | 222.8 KiB |
| Штиль, безветрие | |
|---|---|
| Ю | |
| ЮЮВ | |
| З | |
| ЮВ | |
| Other values (13) |
Length
| Max length | 22 |
|---|---|
| Median length | 16 |
| Mean length | 5.1218965 |
| Min length | 1 |
Characters and Unicode
| Total characters | 73028 |
|---|---|
| Distinct characters | 22 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 2 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | В |
|---|---|
| 2nd row | В |
| 3rd row | Штиль, безветрие |
| 4th row | ССЗ |
| 5th row | Штиль, безветрие |
Common Values
| Value | Count | Frequency (%) |
| Штиль, безветрие | 2928 | |
| Ю | 1335 | |
| ЮЮВ | 1249 | |
| З | 1230 | 8.6% |
| ЮВ | 936 | 6.6% |
| ЗЮЗ | 880 | 6.2% |
| ЗСЗ | 693 | 4.9% |
| ЮЗ | 659 | 4.6% |
| ЮЮЗ | 631 | 4.4% |
| СЗ | 628 | 4.4% |
| Other values (8) | 3089 |
Length
| Value | Count | Frequency (%) |
| штиль | 2928 | |
| безветрие | 2928 | |
| ю | 1335 | 7.7% |
| ююв | 1249 | 7.2% |
| з | 1230 | 7.1% |
| юв | 936 | 5.4% |
| зюз | 880 | 5.1% |
| зсз | 693 | 4.0% |
| юз | 659 | 3.8% |
| ююз | 631 | 3.6% |
| Other values (10) | 3825 |
Most occurring characters
| Value | Count | Frequency (%) |
| е | 9432 | |
| Ю | 8077 | |
| З | 6846 | 9.4% |
| и | 5964 | 8.2% |
| т | 5856 | 8.0% |
| В | 4839 | 6.6% |
| С | 4042 | 5.5% |
| р | 3144 | 4.3% |
| л | 3036 | 4.2% |
| 3036 | 4.2% | |
| Other values (12) | 18756 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 40224 | |
| Uppercase Letter | 26840 | |
| Space Separator | 3036 | 4.2% |
| Other Punctuation | 2928 | 4.0% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| е | 9432 | |
| и | 5964 | |
| т | 5856 | |
| р | 3144 | 7.8% |
| л | 3036 | 7.5% |
| в | 3036 | 7.5% |
| з | 2928 | 7.3% |
| б | 2928 | 7.3% |
| ь | 2928 | 7.3% |
| н | 432 | 1.1% |
| Other values (4) | 540 | 1.3% |
Uppercase Letter
| Value | Count | Frequency (%) |
| Ю | 8077 | |
| З | 6846 | |
| В | 4839 | |
| С | 4042 | |
| Ш | 2928 | 10.9% |
| П | 108 | 0.4% |
Space Separator
| Value | Count | Frequency (%) |
| 3036 |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 2928 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Cyrillic | 67064 | |
| Common | 5964 | 8.2% |
Most frequent character per script
Cyrillic
| Value | Count | Frequency (%) |
| е | 9432 | |
| Ю | 8077 | |
| З | 6846 | |
| и | 5964 | |
| т | 5856 | |
| В | 4839 | 7.2% |
| С | 4042 | 6.0% |
| р | 3144 | 4.7% |
| л | 3036 | 4.5% |
| в | 3036 | 4.5% |
| Other values (10) | 12792 |
Common
| Value | Count | Frequency (%) |
| 3036 | ||
| , | 2928 |
Most occurring blocks
| Value | Count | Frequency (%) |
| Cyrillic | 67064 | |
| ASCII | 5964 | 8.2% |
Most frequent character per block
Cyrillic
| Value | Count | Frequency (%) |
| е | 9432 | |
| Ю | 8077 | |
| З | 6846 | |
| и | 5964 | |
| т | 5856 | |
| В | 4839 | 7.2% |
| С | 4042 | 6.0% |
| р | 3144 | 4.7% |
| л | 3036 | 4.5% |
| в | 3036 | 4.5% |
| Other values (10) | 12792 |
ASCII
| Value | Count | Frequency (%) |
| 3036 | ||
| , | 2928 |
wind_speed
Real number (ℝ)
ZEROS 
| Distinct | 14 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 4 |
| Missing (%) | < 0.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.8254647 |
| Minimum | 0 |
|---|---|
| Maximum | 13 |
| Zeros | 2925 |
| Zeros (%) | 20.5% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 222.8 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 1 |
| median | 1 |
| Q3 | 3 |
| 95-th percentile | 5 |
| Maximum | 13 |
| Range | 13 |
| Interquartile range (IQR) | 2 |
Descriptive statistics
| Standard deviation | 1.7269348 |
|---|---|
| Coefficient of variation (CV) | 0.94602471 |
| Kurtosis | 2.5706051 |
| Mean | 1.8254647 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 1.4576878 |
| Sum | 26022 |
| Variance | 2.9823037 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 4807 | |
| 0 | 2925 | |
| 2 | 2900 | |
| 3 | 1510 | 10.6% |
| 4 | 902 | 6.3% |
| 5 | 535 | 3.8% |
| 6 | 361 | 2.5% |
| 7 | 178 | 1.2% |
| 8 | 79 | 0.6% |
| 9 | 40 | 0.3% |
| Other values (4) | 18 | 0.1% |
| Value | Count | Frequency (%) |
| 0 | 2925 | |
| 1 | 4807 | |
| 2 | 2900 | |
| 3 | 1510 | 10.6% |
| 4 | 902 | 6.3% |
| 5 | 535 | 3.8% |
| 6 | 361 | 2.5% |
| 7 | 178 | 1.2% |
| 8 | 79 | 0.6% |
| 9 | 40 | 0.3% |
| Value | Count | Frequency (%) |
| 13 | 1 | < 0.1% |
| 12 | 5 | < 0.1% |
| 11 | 5 | < 0.1% |
| 10 | 7 | < 0.1% |
| 9 | 40 | 0.3% |
| 8 | 79 | 0.6% |
| 7 | 178 | 1.2% |
| 6 | 361 | |
| 5 | 535 | |
| 4 | 902 |
cloudiness
Real number (ℝ)
ZEROS 
| Distinct | 12 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 1 |
| Missing (%) | < 0.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.68522233 |
| Minimum | 0 |
|---|---|
| Maximum | 1 |
| Zeros | 2918 |
| Zeros (%) | 20.5% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 222.8 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0.4 |
| median | 0.95 |
| Q3 | 1 |
| 95-th percentile | 1 |
| Maximum | 1 |
| Range | 1 |
| Interquartile range (IQR) | 0.6 |
Descriptive statistics
| Standard deviation | 0.40145465 |
|---|---|
| Coefficient of variation (CV) | 0.58587503 |
| Kurtosis | -0.97762388 |
| Mean | 0.68522233 |
| Median Absolute Deviation (MAD) | 0.05 |
| Skewness | -0.85121633 |
| Sum | 9769.9 |
| Variance | 0.16116584 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 6786 | |
| 0 | 2918 | |
| 0.75 | 1867 | 13.1% |
| 0.95 | 929 | 6.5% |
| 0.25 | 429 | 3.0% |
| 0.45 | 425 | 3.0% |
| 0.6 | 364 | 2.6% |
| 0.4 | 260 | 1.8% |
| 0.5 | 111 | 0.8% |
| 0.2 | 104 | 0.7% |
| Other values (2) | 65 | 0.5% |
| Value | Count | Frequency (%) |
| 0 | 2918 | |
| 0.05 | 52 | 0.4% |
| 0.1 | 13 | 0.1% |
| 0.2 | 104 | 0.7% |
| 0.25 | 429 | 3.0% |
| 0.4 | 260 | 1.8% |
| 0.45 | 425 | 3.0% |
| 0.5 | 111 | 0.8% |
| 0.6 | 364 | 2.6% |
| 0.75 | 1867 |
| Value | Count | Frequency (%) |
| 1 | 6786 | |
| 0.95 | 929 | 6.5% |
| 0.75 | 1867 | 13.1% |
| 0.6 | 364 | 2.6% |
| 0.5 | 111 | 0.8% |
| 0.45 | 425 | 3.0% |
| 0.4 | 260 | 1.8% |
| 0.25 | 429 | 3.0% |
| 0.2 | 104 | 0.7% |
| 0.1 | 13 | 0.1% |
MISSING 
| Distinct | 94 |
|---|---|
| Distinct (%) | 0.8% |
| Missing | 3027 |
| Missing (%) | 21.2% |
| Memory size | 222.8 KiB |
Length
| Max length | 132 |
|---|---|
| Median length | 111 |
| Mean length | 17.002582 |
| Min length | 1 |
Characters and Unicode
| Total characters | 190973 |
|---|---|
| Distinct characters | 51 |
| Distinct categories | 7 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 2 ? |
Unique
| Unique | 21 ? |
|---|---|
| Unique (%) | 0.2% |
Sample
| 1st row | Состояние неба в общем не изменилось. |
|---|---|
| 2nd row | |
| 3rd row | |
| 4th row | |
| 5th row | Дымка. |
| Value | Count | Frequency (%) |
| в | 2532 | 10.2% |
| дымка | 2109 | 8.5% |
| срок | 1843 | 7.4% |
| наблюдения | 1843 | 7.4% |
| снег | 1633 | 6.6% |
| слабый | 1275 | 5.1% |
| непрерывный | 1142 | 4.6% |
| дождь | 1045 | 4.2% |
| слабый(ая)(ые | 829 | 3.3% |
| или | 827 | 3.3% |
| Other values (114) | 9793 |
Most occurring characters
| Value | Count | Frequency (%) |
| 28978 | ||
| е | 15785 | 8.3% |
| н | 14231 | 7.5% |
| а | 11689 | 6.1% |
| ы | 10271 | 5.4% |
| и | 9365 | 4.9% |
| л | 7444 | 3.9% |
| с | 7396 | 3.9% |
| о | 7386 | 3.9% |
| в | 6663 | 3.5% |
| Other values (41) | 71765 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 143295 | |
| Space Separator | 28978 | 15.2% |
| Uppercase Letter | 6035 | 3.2% |
| Other Punctuation | 5335 | 2.8% |
| Close Punctuation | 3650 | 1.9% |
| Open Punctuation | 3650 | 1.9% |
| Decimal Number | 30 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| е | 15785 | 11.0% |
| н | 14231 | 9.9% |
| а | 11689 | 8.2% |
| ы | 10271 | 7.2% |
| и | 9365 | 6.5% |
| л | 7444 | 5.2% |
| с | 7396 | 5.2% |
| о | 7386 | 5.2% |
| в | 6663 | 4.6% |
| й | 6164 | 4.3% |
| Other values (20) | 46901 |
Uppercase Letter
| Value | Count | Frequency (%) |
| Д | 2678 | |
| С | 2319 | |
| Л | 797 | 13.2% |
| М | 86 | 1.4% |
| О | 67 | 1.1% |
| Г | 33 | 0.5% |
| Т | 31 | 0.5% |
| З | 22 | 0.4% |
| Ч | 1 | < 0.1% |
| В | 1 | < 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 3 | 13 | |
| 1 | 7 | |
| 2 | 4 | 13.3% |
| 4 | 3 | 10.0% |
| 5 | 3 | 10.0% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 4916 | |
| , | 405 | 7.6% |
| / | 14 | 0.3% |
Space Separator
| Value | Count | Frequency (%) |
| 28978 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 3650 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 3650 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Cyrillic | 149330 | |
| Common | 41643 | 21.8% |
Most frequent character per script
Cyrillic
| Value | Count | Frequency (%) |
| е | 15785 | 10.6% |
| н | 14231 | 9.5% |
| а | 11689 | 7.8% |
| ы | 10271 | 6.9% |
| и | 9365 | 6.3% |
| л | 7444 | 5.0% |
| с | 7396 | 5.0% |
| о | 7386 | 4.9% |
| в | 6663 | 4.5% |
| й | 6164 | 4.1% |
| Other values (30) | 52936 |
Common
| Value | Count | Frequency (%) |
| 28978 | ||
| . | 4916 | 11.8% |
| ) | 3650 | 8.8% |
| ( | 3650 | 8.8% |
| , | 405 | 1.0% |
| / | 14 | < 0.1% |
| 3 | 13 | < 0.1% |
| 1 | 7 | < 0.1% |
| 2 | 4 | < 0.1% |
| 4 | 3 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| Cyrillic | 149330 | |
| ASCII | 41643 | 21.8% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 28978 | ||
| . | 4916 | 11.8% |
| ) | 3650 | 8.8% |
| ( | 3650 | 8.8% |
| , | 405 | 1.0% |
| / | 14 | < 0.1% |
| 3 | 13 | < 0.1% |
| 1 | 7 | < 0.1% |
| 2 | 4 | < 0.1% |
| 4 | 3 | < 0.1% |
Cyrillic
| Value | Count | Frequency (%) |
| е | 15785 | 10.6% |
| н | 14231 | 9.5% |
| а | 11689 | 7.8% |
| ы | 10271 | 6.9% |
| и | 9365 | 6.3% |
| л | 7444 | 5.0% |
| с | 7396 | 5.0% |
| о | 7386 | 4.9% |
| в | 6663 | 4.5% |
| й | 6164 | 4.1% |
| Other values (30) | 52936 |
light
Categorical
IMBALANCE 
| Distinct | 4 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 222.8 KiB |
| В темное время суток, освещение включено | |
|---|---|
| Сумерки | 483 |
| В темное время суток, освещение не включено | 59 |
| В темное время суток, освещение отсутствует | 59 |
Length
| Max length | 43 |
|---|---|
| Median length | 40 |
| Mean length | 38.907006 |
| Min length | 7 |
Characters and Unicode
| Total characters | 554775 |
|---|---|
| Distinct characters | 20 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 2 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | В темное время суток, освещение включено |
|---|---|
| 2nd row | В темное время суток, освещение включено |
| 3rd row | В темное время суток, освещение включено |
| 4th row | В темное время суток, освещение не включено |
| 5th row | В темное время суток, освещение включено |
Common Values
| Value | Count | Frequency (%) |
| В темное время суток, освещение включено | 13658 | |
| Сумерки | 483 | 3.4% |
| В темное время суток, освещение не включено | 59 | 0.4% |
| В темное время суток, освещение отсутствует | 59 | 0.4% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| в | 13776 | |
| темное | 13776 | |
| время | 13776 | |
| суток | 13776 | |
| освещение | 13776 | |
| включено | 13717 | |
| сумерки | 483 | 0.6% |
| не | 59 | 0.1% |
| отсутствует | 59 | 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| е | 96974 | |
| 68939 | ||
| о | 55104 | |
| н | 41328 | 7.4% |
| в | 41328 | 7.4% |
| м | 28035 | 5.1% |
| к | 27976 | 5.0% |
| т | 27788 | 5.0% |
| с | 27670 | 5.0% |
| у | 14377 | 2.6% |
| Other values (10) | 125256 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 457801 | |
| Space Separator | 68939 | 12.4% |
| Uppercase Letter | 14259 | 2.6% |
| Other Punctuation | 13776 | 2.5% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| е | 96974 | |
| о | 55104 | |
| н | 41328 | |
| в | 41328 | |
| м | 28035 | 6.1% |
| к | 27976 | 6.1% |
| т | 27788 | 6.1% |
| с | 27670 | 6.0% |
| у | 14377 | 3.1% |
| р | 14259 | 3.1% |
| Other values (6) | 82962 |
Uppercase Letter
| Value | Count | Frequency (%) |
| В | 13776 | |
| С | 483 | 3.4% |
Space Separator
| Value | Count | Frequency (%) |
| 68939 |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 13776 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Cyrillic | 472060 | |
| Common | 82715 | 14.9% |
Most frequent character per script
Cyrillic
| Value | Count | Frequency (%) |
| е | 96974 | |
| о | 55104 | |
| н | 41328 | |
| в | 41328 | |
| м | 28035 | 5.9% |
| к | 27976 | 5.9% |
| т | 27788 | 5.9% |
| с | 27670 | 5.9% |
| у | 14377 | 3.0% |
| р | 14259 | 3.0% |
| Other values (8) | 97221 |
Common
| Value | Count | Frequency (%) |
| 68939 | ||
| , | 13776 | 16.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| Cyrillic | 472060 | |
| ASCII | 82715 | 14.9% |
Most frequent character per block
Cyrillic
| Value | Count | Frequency (%) |
| е | 96974 | |
| о | 55104 | |
| н | 41328 | |
| в | 41328 | |
| м | 28035 | 5.9% |
| к | 27976 | 5.9% |
| т | 27788 | 5.9% |
| с | 27670 | 5.9% |
| у | 14377 | 3.0% |
| р | 14259 | 3.0% |
| Other values (8) | 97221 |
ASCII
| Value | Count | Frequency (%) |
| 68939 | ||
| , | 13776 | 16.7% |
point
Text
| Distinct | 13907 |
|---|---|
| Distinct (%) | 97.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 222.8 KiB |
Length
| Max length | 37 |
|---|---|
| Median length | 37 |
| Mean length | 36.322673 |
| Min length | 27 |
Characters and Unicode
| Total characters | 517925 |
|---|---|
| Distinct characters | 25 |
| Distinct categories | 7 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 13782 ? |
|---|---|
| Unique (%) | 96.7% |
Sample
| 1st row | {'lat': 55.667499, 'long': 37.770245} |
|---|---|
| 2nd row | {'lat': 55.669411, 'long': 37.553995} |
| 3rd row | {'lat': 55.7174, 'long': 37.568661} |
| 4th row | {'lat': 55.447467, 'long': 37.133403} |
| 5th row | {'lat': 55.844982, 'long': 37.424455} |
| Value | Count | Frequency (%) |
| lat | 14259 | |
| long | 14259 | |
| 55.0 | 17 | < 0.1% |
| 37.0 | 16 | < 0.1% |
| 37.839 | 14 | < 0.1% |
| 55.864 | 13 | < 0.1% |
| 55.893 | 13 | < 0.1% |
| 55.883 | 12 | < 0.1% |
| 55.875 | 12 | < 0.1% |
| 55.874 | 12 | < 0.1% |
| Other values (24585) | 28409 |
Most occurring characters
| Value | Count | Frequency (%) |
| ' | 57036 | 11.0% |
| 5 | 46878 | 9.1% |
| 42777 | 8.3% | |
| 7 | 35363 | 6.8% |
| 3 | 28923 | 5.6% |
| l | 28518 | 5.5% |
| : | 28518 | 5.5% |
| . | 28516 | 5.5% |
| 6 | 20049 | 3.9% |
| 8 | 19168 | 3.7% |
| Other values (15) | 182179 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 218480 | |
| Other Punctuation | 128329 | |
| Lowercase Letter | 99819 | |
| Space Separator | 42777 | 8.3% |
| Close Punctuation | 14259 | 2.8% |
| Open Punctuation | 14259 | 2.8% |
| Uppercase Letter | 2 | < 0.1% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 5 | 46878 | |
| 7 | 35363 | |
| 3 | 28923 | |
| 6 | 20049 | |
| 8 | 19168 | |
| 4 | 16605 | 7.6% |
| 9 | 13881 | 6.4% |
| 1 | 13599 | 6.2% |
| 2 | 13593 | 6.2% |
| 0 | 10421 | 4.8% |
Lowercase Letter
| Value | Count | Frequency (%) |
| l | 28518 | |
| n | 14261 | |
| o | 14261 | |
| g | 14259 | |
| t | 14259 | |
| a | 14259 | |
| e | 2 | < 0.1% |
Other Punctuation
| Value | Count | Frequency (%) |
| ' | 57036 | |
| : | 28518 | |
| . | 28516 | |
| , | 14259 | 11.1% |
Space Separator
| Value | Count | Frequency (%) |
| 42777 |
Close Punctuation
| Value | Count | Frequency (%) |
| } | 14259 |
Open Punctuation
| Value | Count | Frequency (%) |
| { | 14259 |
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 2 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 418104 | |
| Latin | 99821 | 19.3% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| ' | 57036 | |
| 5 | 46878 | |
| 42777 | ||
| 7 | 35363 | 8.5% |
| 3 | 28923 | 6.9% |
| : | 28518 | 6.8% |
| . | 28516 | 6.8% |
| 6 | 20049 | 4.8% |
| 8 | 19168 | 4.6% |
| 4 | 16605 | 4.0% |
| Other values (7) | 94271 |
Latin
| Value | Count | Frequency (%) |
| l | 28518 | |
| n | 14261 | |
| o | 14261 | |
| g | 14259 | |
| t | 14259 | |
| a | 14259 | |
| N | 2 | < 0.1% |
| e | 2 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 517925 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| ' | 57036 | 11.0% |
| 5 | 46878 | 9.1% |
| 42777 | 8.3% | |
| 7 | 35363 | 6.8% |
| 3 | 28923 | 5.6% |
| l | 28518 | 5.5% |
| : | 28518 | 5.5% |
| . | 28516 | 5.5% |
| 6 | 20049 | 3.9% |
| 8 | 19168 | 3.7% |
| Other values (15) | 182179 |
pogoda_region
Categorical
| Distinct | 9 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 222.8 KiB |
| Восток | |
|---|---|
| Юг | |
| Запад | |
| Север | |
| Северо-восток | |
| Other values (4) |
Length
| Max length | 13 |
|---|---|
| Median length | 10 |
| Mean length | 7.0902588 |
| Min length | 2 |
Characters and Unicode
| Total characters | 101100 |
|---|---|
| Distinct characters | 19 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 2 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Юго-восток |
|---|---|
| 2nd row | Юго-запад |
| 3rd row | Центр |
| 4th row | Северо-запад |
| 5th row | Северо-запад |
Common Values
| Value | Count | Frequency (%) |
| Восток | 2086 | |
| Юг | 1903 | |
| Запад | 1888 | |
| Север | 1795 | |
| Северо-восток | 1764 | |
| Северо-запад | 1313 | |
| Центр | 1267 | |
| Юго-восток | 1153 | |
| Юго-запад | 1090 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| восток | 2086 | |
| юг | 1903 | |
| запад | 1888 | |
| север | 1795 | |
| северо-восток | 1764 | |
| северо-запад | 1313 | |
| центр | 1267 | |
| юго-восток | 1153 | |
| юго-запад | 1090 |
Most occurring characters
| Value | Count | Frequency (%) |
| о | 15326 | |
| е | 11011 | |
| а | 8582 | 8.5% |
| в | 7789 | 7.7% |
| т | 6270 | 6.2% |
| р | 6139 | 6.1% |
| - | 5320 | 5.3% |
| с | 5003 | 4.9% |
| к | 5003 | 4.9% |
| С | 4872 | 4.8% |
| Other values (9) | 25785 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 81521 | |
| Uppercase Letter | 14259 | 14.1% |
| Dash Punctuation | 5320 | 5.3% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| о | 15326 | |
| е | 11011 | |
| а | 8582 | |
| в | 7789 | |
| т | 6270 | |
| р | 6139 | |
| с | 5003 | 6.1% |
| к | 5003 | 6.1% |
| п | 4291 | 5.3% |
| д | 4291 | 5.3% |
| Other values (3) | 7816 |
Uppercase Letter
| Value | Count | Frequency (%) |
| С | 4872 | |
| Ю | 4146 | |
| В | 2086 | |
| З | 1888 | 13.2% |
| Ц | 1267 | 8.9% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 5320 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Cyrillic | 95780 | |
| Common | 5320 | 5.3% |
Most frequent character per script
Cyrillic
| Value | Count | Frequency (%) |
| о | 15326 | |
| е | 11011 | |
| а | 8582 | 9.0% |
| в | 7789 | 8.1% |
| т | 6270 | 6.5% |
| р | 6139 | 6.4% |
| с | 5003 | 5.2% |
| к | 5003 | 5.2% |
| С | 4872 | 5.1% |
| п | 4291 | 4.5% |
| Other values (8) | 21494 |
Common
| Value | Count | Frequency (%) |
| - | 5320 |
Most occurring blocks
| Value | Count | Frequency (%) |
| Cyrillic | 95780 | |
| ASCII | 5320 | 5.3% |
Most frequent character per block
Cyrillic
| Value | Count | Frequency (%) |
| о | 15326 | |
| е | 11011 | |
| а | 8582 | 9.0% |
| в | 7789 | 8.1% |
| т | 6270 | 6.5% |
| р | 6139 | 6.4% |
| с | 5003 | 5.2% |
| к | 5003 | 5.2% |
| С | 4872 | 5.1% |
| п | 4291 | 4.5% |
| Other values (8) | 21494 |
ASCII
| Value | Count | Frequency (%) |
| - | 5320 |
region
Text
| Distinct | 121 |
|---|---|
| Distinct (%) | 0.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 222.8 KiB |
Length
| Max length | 25 |
|---|---|
| Median length | 19 |
| Mean length | 11.608388 |
| Min length | 5 |
Characters and Unicode
| Total characters | 165524 |
|---|---|
| Distinct characters | 58 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 2 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Люблино |
|---|---|
| 2nd row | Черемушки |
| 3rd row | Хамовники |
| 4th row | Южное Тушино |
| 5th row | Южное Тушино |
| Value | Count | Frequency (%) |
| северное | 807 | 4.7% |
| южное | 762 | 4.4% |
| тушино | 402 | 2.3% |
| измайлово | 393 | 2.3% |
| чертаново | 382 | 2.2% |
| восточное | 351 | 2.0% |
| бирюлево | 300 | 1.8% |
| орехово-борисово | 277 | 1.6% |
| пресненский | 264 | 1.5% |
| гольяново | 251 | 1.5% |
| Other values (113) | 12945 |
Most occurring characters
| Value | Count | Frequency (%) |
| о | 26434 | |
| е | 14185 | 8.6% |
| и | 12643 | 7.6% |
| н | 12276 | 7.4% |
| в | 11005 | 6.6% |
| к | 9167 | 5.5% |
| р | 8392 | 5.1% |
| а | 7824 | 4.7% |
| с | 7242 | 4.4% |
| й | 5036 | 3.0% |
| Other values (48) | 51320 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 142313 | |
| Uppercase Letter | 18533 | 11.2% |
| Space Separator | 2875 | 1.7% |
| Dash Punctuation | 1803 | 1.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| о | 26434 | |
| е | 14185 | |
| и | 12643 | |
| н | 12276 | |
| в | 11005 | 7.7% |
| к | 9167 | 6.4% |
| р | 8392 | 5.9% |
| а | 7824 | 5.5% |
| с | 7242 | 5.1% |
| й | 5036 | 3.5% |
| Other values (21) | 28109 |
Uppercase Letter
| Value | Count | Frequency (%) |
| С | 2225 | 12.0% |
| М | 1718 | 9.3% |
| Б | 1667 | 9.0% |
| Т | 1225 | 6.6% |
| Д | 997 | 5.4% |
| П | 953 | 5.1% |
| К | 949 | 5.1% |
| В | 929 | 5.0% |
| Н | 909 | 4.9% |
| О | 858 | 4.6% |
| Other values (15) | 6103 |
Space Separator
| Value | Count | Frequency (%) |
| 2875 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 1803 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Cyrillic | 160846 | |
| Common | 4678 | 2.8% |
Most frequent character per script
Cyrillic
| Value | Count | Frequency (%) |
| о | 26434 | |
| е | 14185 | 8.8% |
| и | 12643 | 7.9% |
| н | 12276 | 7.6% |
| в | 11005 | 6.8% |
| к | 9167 | 5.7% |
| р | 8392 | 5.2% |
| а | 7824 | 4.9% |
| с | 7242 | 4.5% |
| й | 5036 | 3.1% |
| Other values (46) | 46642 |
Common
| Value | Count | Frequency (%) |
| 2875 | ||
| - | 1803 |
Most occurring blocks
| Value | Count | Frequency (%) |
| Cyrillic | 160846 | |
| ASCII | 4678 | 2.8% |
Most frequent character per block
Cyrillic
| Value | Count | Frequency (%) |
| о | 26434 | |
| е | 14185 | 8.8% |
| и | 12643 | 7.9% |
| н | 12276 | 7.6% |
| в | 11005 | 6.8% |
| к | 9167 | 5.7% |
| р | 8392 | 5.2% |
| а | 7824 | 4.9% |
| с | 7242 | 4.5% |
| й | 5036 | 3.1% |
| Other values (46) | 46642 |
ASCII
| Value | Count | Frequency (%) |
| 2875 | ||
| - | 1803 |
address
Text
MISSING 
| Distinct | 9301 |
|---|---|
| Distinct (%) | 68.4% |
| Missing | 660 |
| Missing (%) | 4.6% |
| Memory size | 222.8 KiB |
Length
| Max length | 108 |
|---|---|
| Median length | 89 |
| Mean length | 36.990808 |
| Min length | 19 |
Characters and Unicode
| Total characters | 503038 |
|---|---|
| Distinct characters | 84 |
| Distinct categories | 10 ? |
| Distinct scripts | 3 ? |
| Distinct blocks | 4 ? |
Unique
| Unique | 7344 ? |
|---|---|
| Unique (%) | 54.0% |
Sample
| 1st row | г Москва, ул Верхние Поля, 39 |
|---|---|
| 2nd row | г Москва, ул Профсоюзная, 56 |
| 3rd row | г Москва, пр-кт Комсомольский, 48 |
| 4th row | А-113 Центральная кольцевая автомобильная дорога (Московская область), 239 км |
| 5th row | г Москва, ул Туристская, 2 |
| Value | Count | Frequency (%) |
| москва | 13015 | 15.3% |
| г | 12989 | 15.2% |
| ул | 6861 | 8.1% |
| ш | 1938 | 2.3% |
| км | 1881 | 2.2% |
| мкад | 1713 | 2.0% |
| сторона | 1690 | 2.0% |
| дорога | 1624 | 1.9% |
| автомобильная | 1623 | 1.9% |
| кольцевая | 1623 | 1.9% |
| Other values (2783) | 40226 |
Most occurring characters
| Value | Count | Frequency (%) |
| 71587 | 14.2% | |
| о | 43420 | 8.6% |
| а | 37906 | 7.5% |
| к | 33357 | 6.6% |
| с | 29165 | 5.8% |
| , | 27744 | 5.5% |
| в | 27154 | 5.4% |
| М | 17868 | 3.6% |
| г | 16424 | 3.3% |
| р | 15204 | 3.0% |
| Other values (74) | 183209 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 328029 | |
| Space Separator | 71587 | 14.2% |
| Uppercase Letter | 37936 | 7.5% |
| Other Punctuation | 29662 | 5.9% |
| Decimal Number | 29350 | 5.8% |
| Dash Punctuation | 3006 | 0.6% |
| Open Punctuation | 1728 | 0.3% |
| Close Punctuation | 1728 | 0.3% |
| Other Symbol | 11 | < 0.1% |
| Math Symbol | 1 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| о | 43420 | |
| а | 37906 | |
| к | 33357 | |
| с | 29165 | 8.9% |
| в | 27154 | 8.3% |
| г | 16424 | 5.0% |
| р | 15204 | 4.6% |
| н | 15132 | 4.6% |
| л | 14842 | 4.5% |
| я | 14324 | 4.4% |
| Other values (25) | 81101 |
Uppercase Letter
| Value | Count | Frequency (%) |
| М | 17868 | |
| К | 3690 | 9.7% |
| А | 2807 | 7.4% |
| Д | 2297 | 6.1% |
| С | 1427 | 3.8% |
| Б | 1379 | 3.6% |
| В | 1271 | 3.4% |
| П | 1134 | 3.0% |
| Л | 938 | 2.5% |
| Н | 636 | 1.7% |
| Other values (19) | 4489 | 11.8% |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 7573 | |
| 2 | 4734 | |
| 3 | 3181 | |
| 4 | 2607 | 8.9% |
| 5 | 2237 | 7.6% |
| 6 | 2208 | 7.5% |
| 8 | 1781 | 6.1% |
| 7 | 1780 | 6.1% |
| 9 | 1626 | 5.5% |
| 0 | 1623 | 5.5% |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 27744 | |
| . | 1898 | 6.4% |
| " | 20 | 0.1% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 3005 | |
| – | 1 | < 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 71587 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 1728 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 1728 |
Other Symbol
| Value | Count | Frequency (%) |
| № | 11 |
Math Symbol
| Value | Count | Frequency (%) |
| | | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Cyrillic | 365960 | |
| Common | 137073 | 27.2% |
| Latin | 5 | < 0.1% |
Most frequent character per script
Cyrillic
| Value | Count | Frequency (%) |
| о | 43420 | 11.9% |
| а | 37906 | 10.4% |
| к | 33357 | 9.1% |
| с | 29165 | 8.0% |
| в | 27154 | 7.4% |
| М | 17868 | 4.9% |
| г | 16424 | 4.5% |
| р | 15204 | 4.2% |
| н | 15132 | 4.1% |
| л | 14842 | 4.1% |
| Other values (51) | 115488 |
Common
| Value | Count | Frequency (%) |
| 71587 | ||
| , | 27744 | 20.2% |
| 1 | 7573 | 5.5% |
| 2 | 4734 | 3.5% |
| 3 | 3181 | 2.3% |
| - | 3005 | 2.2% |
| 4 | 2607 | 1.9% |
| 5 | 2237 | 1.6% |
| 6 | 2208 | 1.6% |
| . | 1898 | 1.4% |
| Other values (10) | 10299 | 7.5% |
Latin
| Value | Count | Frequency (%) |
| c | 3 | |
| r | 1 | 20.0% |
| A | 1 | 20.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| Cyrillic | 365960 | |
| ASCII | 137066 | 27.2% |
| Letterlike Symbols | 11 | < 0.1% |
| Punctuation | 1 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 71587 | ||
| , | 27744 | 20.2% |
| 1 | 7573 | 5.5% |
| 2 | 4734 | 3.5% |
| 3 | 3181 | 2.3% |
| - | 3005 | 2.2% |
| 4 | 2607 | 1.9% |
| 5 | 2237 | 1.6% |
| 6 | 2208 | 1.6% |
| . | 1898 | 1.4% |
| Other values (11) | 10292 | 7.5% |
Cyrillic
| Value | Count | Frequency (%) |
| о | 43420 | 11.9% |
| а | 37906 | 10.4% |
| к | 33357 | 9.1% |
| с | 29165 | 8.0% |
| в | 27154 | 7.4% |
| М | 17868 | 4.9% |
| г | 16424 | 4.5% |
| р | 15204 | 4.2% |
| н | 15132 | 4.1% |
| л | 14842 | 4.1% |
| Other values (51) | 115488 |
Letterlike Symbols
| Value | Count | Frequency (%) |
| № | 11 |
Punctuation
| Value | Count | Frequency (%) |
| – | 1 |
datetime
Date
| Distinct | 13836 |
|---|---|
| Distinct (%) | 97.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 222.8 KiB |
| Minimum | 2016-12-31 22:50:00 |
|---|---|
| Maximum | 2021-11-30 23:02:00 |
severity
Categorical
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 222.8 KiB |
| Легкий | |
|---|---|
| Тяжёлый | |
| С погибшими | 777 |
Length
| Max length | 11 |
|---|---|
| Median length | 6 |
| Mean length | 6.4888141 |
| Min length | 6 |
Characters and Unicode
| Total characters | 92524 |
|---|---|
| Distinct characters | 19 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 2 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Легкий |
|---|---|
| 2nd row | Легкий |
| 3rd row | Легкий |
| 4th row | Тяжёлый |
| 5th row | Легкий |
Common Values
| Value | Count | Frequency (%) |
| Легкий | 10397 | |
| Тяжёлый | 3085 | 21.6% |
| С погибшими | 777 | 5.4% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| легкий | 10397 | |
| тяжёлый | 3085 | 20.5% |
| с | 777 | 5.2% |
| погибшими | 777 | 5.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| й | 13482 | |
| и | 12728 | |
| г | 11174 | |
| Л | 10397 | |
| е | 10397 | |
| к | 10397 | |
| ы | 3085 | 3.3% |
| л | 3085 | 3.3% |
| ё | 3085 | 3.3% |
| ж | 3085 | 3.3% |
| Other values (9) | 11609 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 77488 | |
| Uppercase Letter | 14259 | 15.4% |
| Space Separator | 777 | 0.8% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| й | 13482 | |
| и | 12728 | |
| г | 11174 | |
| е | 10397 | |
| к | 10397 | |
| ы | 3085 | 4.0% |
| л | 3085 | 4.0% |
| ё | 3085 | 4.0% |
| ж | 3085 | 4.0% |
| я | 3085 | 4.0% |
| Other values (5) | 3885 | 5.0% |
Uppercase Letter
| Value | Count | Frequency (%) |
| Л | 10397 | |
| Т | 3085 | 21.6% |
| С | 777 | 5.4% |
Space Separator
| Value | Count | Frequency (%) |
| 777 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Cyrillic | 91747 | |
| Common | 777 | 0.8% |
Most frequent character per script
Cyrillic
| Value | Count | Frequency (%) |
| й | 13482 | |
| и | 12728 | |
| г | 11174 | |
| Л | 10397 | |
| е | 10397 | |
| к | 10397 | |
| ы | 3085 | 3.4% |
| л | 3085 | 3.4% |
| ё | 3085 | 3.4% |
| ж | 3085 | 3.4% |
| Other values (8) | 10832 |
Common
| Value | Count | Frequency (%) |
| 777 |
Most occurring blocks
| Value | Count | Frequency (%) |
| Cyrillic | 91747 | |
| ASCII | 777 | 0.8% |
Most frequent character per block
Cyrillic
| Value | Count | Frequency (%) |
| й | 13482 | |
| и | 12728 | |
| г | 11174 | |
| Л | 10397 | |
| е | 10397 | |
| к | 10397 | |
| ы | 3085 | 3.4% |
| л | 3085 | 3.4% |
| ё | 3085 | 3.4% |
| ж | 3085 | 3.4% |
| Other values (8) | 10832 |
ASCII
| Value | Count | Frequency (%) |
| 777 |
parent_region
Categorical
CONSTANT 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 222.8 KiB |
| Москва |
|---|
Length
| Max length | 6 |
|---|---|
| Median length | 6 |
| Mean length | 6 |
| Min length | 6 |
Characters and Unicode
| Total characters | 85554 |
|---|---|
| Distinct characters | 6 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Москва |
|---|---|
| 2nd row | Москва |
| 3rd row | Москва |
| 4th row | Москва |
| 5th row | Москва |
Common Values
| Value | Count | Frequency (%) |
| Москва | 14259 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| москва | 14259 |
Most occurring characters
| Value | Count | Frequency (%) |
| М | 14259 | |
| о | 14259 | |
| с | 14259 | |
| к | 14259 | |
| в | 14259 | |
| а | 14259 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 71295 | |
| Uppercase Letter | 14259 | 16.7% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| о | 14259 | |
| с | 14259 | |
| к | 14259 | |
| в | 14259 | |
| а | 14259 |
Uppercase Letter
| Value | Count | Frequency (%) |
| М | 14259 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Cyrillic | 85554 |
Most frequent character per script
Cyrillic
| Value | Count | Frequency (%) |
| М | 14259 | |
| о | 14259 | |
| с | 14259 | |
| к | 14259 | |
| в | 14259 | |
| а | 14259 |
Most occurring blocks
| Value | Count | Frequency (%) |
| Cyrillic | 85554 |
Most frequent character per block
Cyrillic
| Value | Count | Frequency (%) |
| М | 14259 | |
| о | 14259 | |
| с | 14259 | |
| к | 14259 | |
| в | 14259 | |
| а | 14259 |
| atmospheric_pressure | cloudiness | direction_of_the_wind | humidity | light | month | pogoda_region | severity | temperature | wind_speed | year | |
|---|---|---|---|---|---|---|---|---|---|---|---|
| atmospheric_pressure | 1.000 | -0.172 | 0.097 | -0.210 | 0.000 | 0.142 | 0.028 | 0.013 | -0.139 | -0.144 | 0.126 |
| cloudiness | -0.172 | 1.000 | 0.107 | 0.399 | 0.014 | 0.127 | 0.203 | 0.044 | -0.280 | -0.015 | 0.097 |
| direction_of_the_wind | 0.097 | 0.107 | 1.000 | 0.047 | 0.010 | 0.073 | 0.182 | 0.030 | -0.049 | -0.030 | 0.092 |
| humidity | -0.210 | 0.399 | 0.047 | 1.000 | 0.038 | 0.187 | 0.082 | 0.000 | -0.233 | 0.032 | 0.066 |
| light | 0.000 | 0.014 | 0.010 | 0.038 | 1.000 | -0.010 | 0.053 | 0.035 | 0.062 | -0.002 | 0.015 |
| month | 0.142 | 0.127 | 0.073 | 0.187 | -0.010 | 1.000 | 0.024 | 0.033 | 0.057 | -0.002 | 0.093 |
| pogoda_region | 0.028 | 0.203 | 0.182 | 0.082 | 0.053 | 0.024 | 1.000 | 0.119 | 0.011 | 0.441 | 0.152 |
| severity | 0.013 | 0.044 | 0.030 | 0.000 | 0.035 | 0.033 | 0.119 | 1.000 | 0.031 | 0.061 | 0.066 |
| temperature | -0.139 | -0.280 | -0.049 | -0.233 | 0.062 | 0.057 | 0.011 | 0.031 | 1.000 | -0.188 | 0.142 |
| wind_speed | -0.144 | -0.015 | -0.030 | 0.032 | -0.002 | -0.002 | 0.441 | 0.061 | -0.188 | 1.000 | 0.082 |
| year | 0.126 | 0.097 | 0.092 | 0.066 | 0.015 | 0.093 | 0.152 | 0.066 | 0.142 | 0.082 | 1.000 |
| year | month | temperature | atmospheric_pressure | humidity | direction_of_the_wind | wind_speed | cloudiness | weather_conditions | light | point | pogoda_region | region | address | datetime | severity | parent_region | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 2021.0 | 5.0 | 11.0 | 751.6 | 100.0 | В | 3.0 | 0.00 | NaN | В темное время суток, освещение включено | {'lat': 55.667499, 'long': 37.770245} | Юго-восток | Люблино | г Москва, ул Верхние Поля, 39 | 2021-05-21 00:35:00 | Легкий | Москва |
| 1 | 2021.0 | 5.0 | 17.0 | 756.9 | 68.0 | В | 4.0 | 0.20 | NaN | В темное время суток, освещение включено | {'lat': 55.669411, 'long': 37.553995} | Юго-запад | Черемушки | г Москва, ул Профсоюзная, 56 | 2021-05-14 22:30:00 | Легкий | Москва |
| 4 | 2021.0 | 7.0 | 23.5 | 758.5 | 68.0 | Штиль, безветрие | 0.0 | 0.95 | Состояние неба в общем не изменилось. | В темное время суток, освещение включено | {'lat': 55.7174, 'long': 37.568661} | Центр | Хамовники | г Москва, пр-кт Комсомольский, 48 | 2021-07-15 21:20:00 | Легкий | Москва |
| 11 | 2021.0 | 7.0 | 19.2 | 755.9 | 55.0 | ССЗ | 1.0 | 1.00 | В темное время суток, освещение не включено | {'lat': 55.447467, 'long': 37.133403} | Северо-запад | Южное Тушино | А-113 Центральная кольцевая автомобильная дорога (Московская область), 239 км | 2021-07-21 20:36:00 | Тяжёлый | Москва | |
| 15 | 2020.0 | 8.0 | 26.1 | 755.8 | 53.0 | Штиль, безветрие | 0.0 | 0.95 | В темное время суток, освещение включено | {'lat': 55.844982, 'long': 37.424455} | Северо-запад | Южное Тушино | г Москва, ул Туристская, 2 | 2020-08-31 19:10:00 | Легкий | Москва | |
| 17 | 2020.0 | 8.0 | 13.6 | 761.0 | 79.0 | Штиль, безветрие | 0.0 | 0.95 | В темное время суток, освещение включено | {'lat': 55.828775, 'long': 37.530445} | Север | Коптево | г Москва, ул Академическая Б., 29 | 2020-08-20 00:23:00 | Легкий | Москва | |
| 18 | 2020.0 | 8.0 | 17.3 | 763.9 | 88.0 | Штиль, безветрие | 0.0 | 0.25 | Дымка. | В темное время суток, освещение включено | {'lat': 55.833373, 'long': 37.518053} | Север | Коптево | г Москва, ул Коптевская, 73А СТР 1 | 2020-08-08 23:00:00 | Легкий | Москва |
| 26 | 2020.0 | 9.0 | 20.6 | 766.1 | 36.0 | ЮВ | 1.0 | 1.00 | В темное время суток, освещение включено | {'lat': 55.755508, 'long': 37.631865} | Центр | Тверской | г Москва, пл Старая, 6 | 2020-09-04 20:35:00 | Тяжёлый | Москва | |
| 29 | 2020.0 | 9.0 | 11.3 | 758.1 | 92.0 | ЗСЗ | 1.0 | 0.95 | Дымка. | В темное время суток, освещение включено | {'lat': 55.784827, 'long': 37.369952} | Северо-запад | Строгино | г Москва, Московская кольцевая автомобильная дорога (МКАД) внешняя сторона, 62 км | 2020-09-08 23:05:00 | Легкий | Москва |
| 33 | 2020.0 | 9.0 | 6.1 | 764.4 | 85.0 | Штиль, безветрие | 0.0 | 0.00 | В темное время суток, освещение включено | {'lat': 55.830167, 'long': 37.552664} | Север | Тимирязевский | г Москва, ул Тимирязевская, 44 | 2020-09-20 22:50:00 | Легкий | Москва |
| year | month | temperature | atmospheric_pressure | humidity | direction_of_the_wind | wind_speed | cloudiness | weather_conditions | light | point | pogoda_region | region | address | datetime | severity | parent_region | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 35010 | 2020.0 | 11.0 | 2.6 | 770.5 | 68.0 | ЗСЗ | 1.0 | 0.6 | В темное время суток, освещение включено | {'lat': 55.768981, 'long': 37.473797} | Северо-запад | Хорошево-Мневники | г Москва, наб Карамышевская, 34 | 2020-11-10 17:15:00 | Легкий | Москва | |
| 35012 | 2020.0 | 11.0 | 1.2 | 756.9 | 93.0 | Штиль, безветрие | 0.0 | 1.0 | Дымка. | В темное время суток, освещение включено | {'lat': 55.805598, 'long': 37.454077} | Северо-запад | Щукино | г Москва, ул Новощукинская, 7 | 2020-11-23 19:35:00 | Тяжёлый | Москва |
| 35014 | 2020.0 | 11.0 | 0.9 | 773.7 | 79.0 | ВЮВ | 2.0 | 1.0 | Снег с перерывами слабый в срок наблюдения. | В темное время суток, освещение включено | {'lat': 55.800189, 'long': 37.392172} | Северо-запад | Строгино | г Москва, ул Таллинская, 2 | 2020-11-15 18:46:00 | Легкий | Москва |
| 35015 | 2020.0 | 11.0 | 6.4 | 767.1 | 87.0 | В | 1.0 | 1.0 | Дымка. | В темное время суток, освещение включено | {'lat': 55.769742, 'long': 37.485909} | Северо-запад | Хорошево-Мневники | г Москва, наб Карамышевская, 2 1 | 2020-11-02 17:10:00 | Легкий | Москва |
| 35016 | 2020.0 | 11.0 | -1.7 | 769.1 | 77.0 | Ю | 1.0 | 1.0 | В темное время суток, освещение включено | {'lat': 55.864024, 'long': 37.424723} | Северо-запад | Северное Тушино | г Москва, ул Вилиса Лациса, 17 стр. 1 | 2020-11-22 04:50:00 | Легкий | Москва | |
| 35017 | 2020.0 | 11.0 | 6.2 | 763.7 | 73.0 | ЮЮЗ | 1.0 | 0.4 | В темное время суток, освещение включено | {'lat': 55.787234, 'long': 37.479386} | Северо-запад | Хорошево-Мневники | NaN | 2020-11-05 17:20:00 | Легкий | Москва | |
| 35018 | 2020.0 | 11.0 | 0.3 | 772.0 | 90.0 | ЮВ | 2.0 | 1.0 | Снег непрерывный слабый в срок наблюдения. | В темное время суток, освещение включено | {'lat': 55.857009, 'long': 37.342669} | Северо-запад | Митино | г Москва, ул Барышиха, 44 | 2020-11-30 20:00:00 | Легкий | Москва |
| 35019 | 2020.0 | 11.0 | 4.1 | 767.8 | 89.0 | В | 2.0 | 1.0 | Морось незамерзающая непрерывная слабая в срок наблюдения. | В темное время суток, освещение включено | {'lat': 55.850433, 'long': 37.413415} | Северо-запад | Северное Тушино | г Москва, б-р Яна Райниса, 24 к. 1 | 2020-11-01 21:30:00 | Легкий | Москва |
| 35020 | 2020.0 | 11.0 | 1.0 | 766.5 | 80.0 | З | 2.0 | 1.0 | Ливневый снег слабый в срок наблюдения или за последний час. | В темное время суток, освещение включено | {'lat': 55.802372, 'long': 37.407482} | Северо-запад | Строгино | г Москва, б-р Строгинский, 30 | 2020-11-09 21:00:00 | Легкий | Москва |
| 35021 | 2021.0 | 10.0 | 9.0 | 773.4 | 54.0 | ЮЗ | 2.0 | 0.0 | NaN | В темное время суток, освещение включено | {'lat': 55.546103, 'long': 37.585237} | Юго-запад | Южное Бутово | г Москва, ш Варшавское, 196 | 2021-10-09 22:53:00 | Легкий | Москва |